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(54) SOUND ENCODER AND SOUND DECODER 

(57) An excitation vector generator comprises a 
pulse vector generating section having N channels 
(Ns1) for generating puise vectors, a storing section for 
storing M (Msl)kinds of dispersion patterns every 
channel in accordance with N channels, a selecting sec- 
tion for selectively taking out a dispersion pattern from 
the storing section every channel, a dispersion section 



for performing a superimposing calculation of the 
extracted dispersion pattern and the generated pulse 
vectors every channel so as to generate N dispersion 
vectors, excitation vector generating section for gener- 
ating an excitation vector from N dispersion vectors 
generated. 
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Description 

Technical Field 

5 [0001] The present invention relates to a speech coder for efficiently coding speech information and a speech decoder 
for efficiently decoding the same. 

Background Art 

10 [0002] A speech coding technique for efficiently coding and decoding speech information has been developed in 
recent years In Code Excited Linear Prediction: "High Quality Speech at Low Bit Rate", M. R. Schroeder, Proc. 
ICASSP'85. pp. 937-940, there is described a speech coder of a CELP type, which is on the basis of such a speech 
coding technique. 

[0003] In this speech coder, a linear prediction for an input speech is carried out in every frame, which is divided at a 
is fixed time. A prediction residual (excitation signal) is obtained by the linear prediction for each frame. Then, the predic- 
tion residual is coded using an adaptive codebook in which a previous excitation signal is stored and a random code- 
book in which a plurality of random code vectors is stored. \ 
[0004] FIG. 1 shows a functional block of a conventional CELP type speech coder. 

[0005] A speech signal 1 1 input to the CELP type speech coder is subjected to a linear prediction analysis in a linear 
20 prediction analyzing section 12. A linear predictive coefficients can be obtained by the linear prediction analysis. The 
linear predictive coefficients are parameters indicating an spectrum envelop of the speech signal 1 1 . The linear predic- 
tive coefficients obtained in the linear prediction analyzing section 12 are quantized by a linear predictive coefficient 
coding section 13, and the quantized linear predictive coefficients are sent to a linear predictive coefficient decoding 
section 1 4. Note that an index obtained by this quantization is output to a code outputting section 24 as a linear predic- 
ts tive code. The linear predictive coefficient decoding section 1 4 decodes the linear predictive coefficients quantized by 
the linear predictive coefficient coding section 1 3 so as to obtain coefficients of a synthetic filter. The linear predictive 
coefficient decoding section 14 outputs these coefficients to a synthetic filter 15. 

[0006] An adaptive codebook 1 7 is one, which outputs a plurality of candidates of adaptive codevectors, and which 
comprises a buffer for storing excitation signals corresponding to previous several frames. The adaptive codevectors 
30 are time series vectors, which express periodic components in the input speech. 

[0007] A random codebook 1 8 is one, which stores a plurality of candidates of random codevectors. The random code 
vectors are time series vectors, which express non-periodic components in the input speech. 

[0008] In an adaptive code gain weighting section 19 and a random code gain weighting section 20, the candidate 
vectors output from the adaptive codebook 17 and the random codebook 18 are multiplied by an adaptive code gain 
35 read from a weight codebook 21 and a random code gain, respectively, and the resultants are output to an adding sec- 
tion 22. 

[0009] The weighting codebook stores a plurality of adaptive codebook gains by which the adaptive codevector is mul- 
tiplied and a plurality of random codebook gains by which the random codevectors are multiplied. 
[0010] The adding section 22 adds the adaptive code vector candidates and the random code vector candidates, 

40 which are weighted in the adaptive code gain weighting section 19 and the random code gain weighting section 20, 
respectively. Then, the adding section 22 generates excitation vectors so as to be output to the synthetic filter 15. 
[001 1 ] The synthetic filter 1 5 is an all-pole filter. The coefficients of the synthetic filter are obtained by the linear pre- 
dictive coefficient decoding section 14. The synthetic filter 15 has a function of synthesizing input excitation vector in 
order to produce synthetic speech and outputting that synthetic speech to a distortion calculator 16. 

45 [001 2] A distortion calculator 1 6 calculates a distortion between the synthetic speech, which is the output of the syn- 
thetic filter 15, and the input speech 1 1 , and outputs the obtained distortion value to a code index specifying section 23. 
The code index specifying section 23 specifies three kinds of codebook indicies (index of adaptive codebook, index of 
random codebook, index of weight codebook) so as to minimize the distortion calculated by the distortion calculation 
section 16. The three kinds of codebook indicies specified by the code index specifying section 23 are output to a code 

so outputting section 24. The code outputting section 24 outputs the index of linear predictive codebook obtained by the 
linear predictive coefficient coding section 13 and the index of adaptive codebook, the index of random code, the index 
of weight codebook, which have been specified by the code index specifying section 23, to a transmission path at one 
time. 

[0013] FIG. 2 shows a functional block of a CELP speech decoder, which decodes the speech signal coded by the 
55 aforementioned coder. In this speech decoder apparatus, a code input section 31 receives codes sent from the speech 
coder (FIG. 1). The received codes are disassembled into the index of the linear predictive codebook, the index of adap- 
tive codebook, the index of random codebook, and the index of weight codebook Then, the indicies obtained by the 
above disassemble are output to a linear predictive coefficient decoding section 32, an adaptive codebook 33, a random 
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codebook 34, and a weight codebook 35, respectively. 

[0014] Next, the linear predictive coefficient decoding section 32 decodes the linear predictive code number obtained 
by the code input section 31 so as to obtain coefficients of the synthetic filter, and outputs those coefficients to a syn- 
thetic filter 39. TJien, an adaptive codevector corresponcfing to the index of adaptive codebook is read from adaptive 
codebook, and a random codevector corresponding to the index of random codebook is read from the random code- 
book. Moreover, an adaptive codebook gain and a random codebook gain corresponding to the index of weight code- 
book are read from the weight codebook. Then, in an adaptive codevector weighting section 36, the adaptive 
codevector is multiplied by the adaptive codebook gain, and the resultant is sent to an adding section 38. Similarly, in a 
random codevector weighting section 37, the random codevector is multiplied by the random codebook gain, and the 
resultant is sent to the adding section 38. 

[0015] The adding section 38 adds the above two codevectors and generates an excitation vector. Then, the gener- 
ated excitation vector is sent to the adaptive codebook 33 to update the buffer or the synthetic filter 39 to excite the filter. 
The synthetic f flter 39, composed with the linear predictive coeffcients which are output from linear predictive coefficient 
decoding section 32, is exerted by the excitation vector obtained by the adding section 38. and reproduces a synthetic 
speech. 

[001 6] Note that, in the distortion calculator 1 6 of the CELP speech coder, distortion E is generally calculated by the 
following expression (1): 



20 



E = |l v - (gaHP+gcHC) \\' 

where v: an input speech signal (vector), 

H: an impulse response convolution matrix for a synthetic filter 



(1) 



30 



• h{G) 0 




... 0 


0 ' 


h(l) A(0) 


0 


... 0 


0 


A(2) A(l) 


A(0) 


0 0 


0 






"-. 0 


0 






•• HO) 


0 


A(L-l) •• 




... 


A(0) 



wherein h is an impulse response of a synthetic filter, L is a frame length, 
p: an adaptive codevector, 
c: a random codevector, 
ga: an adaptive codebook gain 
40 gc: a random codebook gain 

[0017] Here, in order to minimize distortion E of expression (1), the distortion is calculated by a closed loop with 
respective to all combinations of the adaptive code number, the random code number, the weight code number, it is 
necessary to specify each code number. 

45 [0018] However, if the closed loop search is performed with respect to expression (1), an amount of calculation 
processing becomes too large. For this reason, generally, first of all, the index of adaptive codebook is specified by vec- 
tor quantization using the adaptive codebook. Next the index of random coodbook is specified by vector quantization 
using the random codebook. Finally, the index of weight codebook is specified by vector quantization using the weight 
codebook. Here, the following will specifically explain the vector quantization processing using the random codebook. 

so [001 9] In a case where the index of adaptive codebook or the adaptive codebook gain are previously or temporarily 
determined, the expression for evaluating distortion shown in expression (1 ) is changed to the following expression (2) : 

Ec = || x-gcHC || 2 (2) 

55 where vector x in expression (2) is random excitation target vector for specifying a random code number which is 
obtained by the following equation (3) using the previously or temporarily specified adaptive codevector and adaptive 
codebook gain. 
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x - v^gaHP ( 3 ) 

where ga: an adaptive codebook gain, 
v: a speech signal (vector), 

H: an impulse response convolution matrix for a synthetic f ilter, 
p: an adaptive codevector. 

[0020] For specifying the random codebook gain gc after specifying the index of random codebook, it can be assumed 
that gc in the expression (2) can be set to an arbitrary value. For this reason, it is known that a quantization processing 
for specifying the index of the random codebook minimizing the expression (2) can be replaced with the determination 
of the index of the random codebook vector maximizing the following fractional expression (4): 

(* f "c) 2 (4) 
|Hci 2 

[0021 ] In other words, in a case where the index of adaptive codebook and the adaptive codebook gain are previously 
or temporarily determined, vector quantization processing for random excitation becomes processing for specifying the 
index of the random codebook maximizing fractional expression (4) calculated by the distortion calculator 16. 
[0022] In the CELP coder/decoder in the early stages, one that stores kinds of random sequences corresponding to 
the number of bits allocated in the memory was used as a random codebook. However, there was a problem in which 
a massive amount of memory capacity was required and the amount of calculation processing for calculating distortion 
of expression (4) with respect to each random codevector was greatly increased. 

[0023] As one of methods for solving the above problem, there is a CELP speech coder/decoder using an algebraic 
excitation vector generator for generating an excitation vector algebraically as described in "8KBIT/S ACELP CODING 
OF SPEECH WITH 10 MS SPEECH-FRAME: A CANDIDATE FOR CCITT STANDARDIZATION'': R. Salami. C. 
Laflamme. J-P. Adoul. ICASSP*94. pp.ll-97~ll-100. 1994. 

[0024] However, in the above CELP speech coder/decoder using an algebraic excitation vector generator, random 
excitation (target vector for specifying an index of random codebook) obtained by equation (3) is approximately 
expressed by a few signed pulses. For this reason, there is a limitation in improvement of speech quality. This is obvious 
from an actual investigation of an element for random excitation x of expression (3) wherein there are few cases in 
which random excitations are composed oily of a few signed pulses. 

Disclosure of Invention 

[0025] An object of the present invention is to provide an excitation vector generator, which is capable of generating 
an excitation vector whose shape has a statistically high similarity to the shape of a random excitation obtained by ana- 
lyzing an input speech signal. . 
[0026] Also, an object of the present invention is to provide a CELP speech coder/decoder, a speech signal commu- 
nication system, a speech signal recording system, which use the above excitation vector generator as a random code- 
book so as to obtain a synthetic speech having a higher quality than that of the case in which an algebraic excitation 
vector generator is used as a random codebook. 

[0027] A first aspect of the present invention is to provide an excitation vector generator comprising a pulse vector 
generating section having N channels (N*t) for generating pulse vectors each having a signed unit pulse provided to 
one element on a vector axis, a storing and selecting section having a function of storing M (M*1)kinds of dispersion 
patterns every channel and a function of selecting a certain kind of dispersion pattern from M kinds of dispersion pat- 
terns stored, a pulse vector dispersion section having a function of convolving the dispersion pattern selected from the 
dispersion pattern storing and selecting section to the signed pulse vector output from the pulse vector generator so as 
to generator N dispersed vectors, and a dispersed vector adding section having a function of adding N dispersed vec- 
tors generated by the pulse vector dispersion section so as to generate an excitation vector. The function for algebrai- 
cally generating (Ns1) pulse vectors is provided to the pulse vector generator, and the dispersion pattern storing and 
selecting section stores the dispersion patterns obtained by pre-training the shape (characteristic) of the actual vector, 
whereby making it possible to generate the excitation vector, which is well similar to the shape of the actual excitation 
vector as compared with the conventional algebraic excitation generator. 

[0028] Moreover, the second aspect of the present invention is to provide a CELP speech coder/decoder using the 
above excitation vector generator as the random codebook, which is capable of generating the excitation vector being 
closer to the actual shape than the case of the conventional speech coder/decoder using the algebraic excitation gen- 
erator as the random codebook. Therefore, there can be obtained the speech coder/decoder, speech signal com muni- 
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cation system, and speech signal recording system, which can output the synthetic speech having a higher quality. 
Brief Description of Drawings 
5 [0029] 

FIG. 1 is a functional block diagram of a conventional CELP speech coder; 
FIG. 2 is a functional block diagram of a conventional CELP speech decoder; 

FIG. 3 is a functional block diagram of an excitation vector generator according to a first embodiment of the present 
w invention; 

FIG. 4 is a functional block diagram of a CELP speech coder according to a second embodiment of the present 
invention; 

FIG. 5 is a functional block diagram of a CELP speech decoder according to the second embodiment of the present 
invention; 

is FIG. 6 is a functional block diagram of a CELP speech coder according to a third embodiment of the present inven- 
tion; 

FIG. 7 is a functional block diagram of a CELP speech coder according to a fourth embodiment of the present 
invention; 

FIG. 8 is a functional block diagram of a CELP speech coder according to a fifth embodiment of the present inven- 
20 tion; 

FIG. 9 is a functional block diagram of a vector quantization function according to the fifth embodiment of the 
present invention; 

FIG. 10 is a view explaining an algorithm for a target extraction according to the fifth embodiment of the present 
invention; 

25 FIG. 1 1 is a functional block diagram of a predictive quantization according to the fifth embodiment of the present 
invention; 

FIG. 12 is a functional block diagram of a predictive quantization according to a sixth embodiment of the present 
invention; 

FIG. 1 3 is a functional block diagram of a CELP speech coder according to a seventh embodiment of the present 
30 invention; and 

FIG. 1 4 is a functional block diagram of a distortion calculator according to the seventh embodiment of the present 
invention. 

Best Mode for Carrying Out the Invention 

35 

[0030] Embodiments will now be described with reference to the accompanying drawings. 
(First embodiment) 

40 [0031] FIG. 3 is a functional block diagram of an excitation vector generator according to a first embodiment of the 
present invention. 

[0032] The excitation vector generator comprises a pulse vector generator 101 having a plurality of channels, a dis- 
persion pattern storing and selecting section 102 having dispersion pattern storing sections and switches, a pulse vec- 
tor dispersion section 103 for dispersing the pulse vectors, and a dispersed vector adding section 104 for adding the 
45 dispersed pulse vectors for the plurality of channels. 

[0033] The pulse vector generator 101 comprises N (a case of N=3 will be explained in this embodiment) channels 
for generating vectors (hereinafter referred to as pulse vectors) each having a signed unit pulse with provided to one 
element on a vector axis. 

[0034] The dispersion pattern storing and selecting section 1 02 comprises storing sections M1 to M3 for storing M (a 
so case of M=2 will be explained in this embodiment) kinds of dispersion patterns for each channel and switches SW1 to 
SW2 for selecting one kind of dispersion pattern from M kinds of dispersion patterns stored in the respective storing 
sections M1 to M3. 

[0035] The pulse vector dispersion section 1 03 performs convolution of the pulse vectors output from the pulse vector 
generator 101 and the dispersion patterns output from the dispersion pattern storing and selecting section 102 in every 
55 channel so as to generate N dispersed vectors. 

[0036] The dispersed vector adding section 1 04 adds up N dispersed vectors generated by the pulse vector disper- 
sion section 103, thereby generating an excitation vector 105. 

[0037] Note that, in this embodiment, a case in which the pulse vector generator 101 algebraically generates N (N=3) 
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pulse vectors in accordance 
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with the rule described in Table 1 set forth below will be explained. 
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l0 038] An operaton of the ^^^^02^ at^pSern by one Kind by one from 
10039] ™e di ^ ereion ^SL a ^ tne dispersion pattern. In this case, the 

(total number of combinations: M N=8 )- alnoK roi« l «v Generates the signed pulse vectors corresponding to the 



ci(n) i 



(5) 



where n: 0-L-1. 
35 L: dispersion vector length, 
i: channel number. 

j- dispersion pattern number (j=1 ~M ), 

»j » e vector ,ength of * i(m) is 2L " 1 (m: " (L ' 1)-L ' 1 } - "* rt is *" e,e " 

merit. IJj that can specify the value and the other elements are zero, 
di: signed pulse vector for channel i, 
di = ± 6 (n-pi) . n = 0-L-1 . and 
pi: pulse position candidate tor channel i. 

^ wortnr aridino section 1 04 adds up three dispersed vectors generated by the pulse vector dis- 

c(n) - £c/(/i) v ' 

50 



40 



45 



where c: excitation vector, 
ci: dispersed vector. 



[tKW3] ^ above-strucfcred excitation vector generator can generate various e**ation vectors by adding variations 
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to the combinations of the dispersion patterns, which the dispersion pattern storing and selecting section 1 02 selects, 
and the pulse position and polarity in the pulse vector, which the pulse vector generator 101 generates, 
[0044] Then, in the above-structured excitation vector generator, it is possible to allocate bits to two kinds of informa- 
tion having the combinations of dispersion patterns selected by the dispersion pattern storing and selecting section 102 

5 and the combinations of the shapes (the pulse positions and polarities) generated by the pulse vector generator 101 . 
The indices of this excitation vector generator are in a one-to-one correspondence with two kinds of information. Also, 
a training processing is executed based on actual excitation information in advance and the dispersion patterns obtain- 
able as the training result can be stored in the dispersion pattern storing and selecting section 102. 
[0045] Moreover, the above excitation vector generator is used as the excitation information generator of speech 

w coder/decoder to transmit two kinds of indices including the combination index of dispersion patterns selected by the 
dispersion pattern storing and selecting section 102 and the combination index of the configuration (the pulse positions 
and polarities) generated by the pulse vector generator 101 . thereby making it possible to transmit information on ran- 
dom excitation. 

[0046] Also, the use of the above-structured excitation vector generator allows the configuration (characteristic) sim- 
is War to actual excitation information to be generated as compared with the use of algebraic codebook. 

[0047] The above embodiment explained the case in which the dispersion pattern storing and selecting section 102 
stored two kinds of dispersion patterns per one channel. However, the similar function and effect can be obtained in a 
case in which the dispersion patterns other than two kinds are allocated to each channel. 

[0048] Also, the above embodiment explained the case in which the pulse vector generator 101 was based on the 
20 three-channel structure and the pulse generation rule described in Table 1 . However, the similar function and effect can 
be obtained in a case in which the number of channels is different and a case in which the pulse generation rule other 
than Table 1 is used as a pulse generation rule. 

[0049] A speech signal communication system or a speech signal recording system having the above excitation vec- 
tor generator or the speech coder/decoder is structured, thereby obtaining the functions and effects which the above 
25 excitation vector generator has. 

(Second embodiment) 

[0050] FIG. 4 shows a functional block of a CELP speech coder according to the second embodiment, and FIG. 5 

30 shows a functional block of a CELP speech decoder. 

[0051 ] The CELP speech coder according to this embodiment applies the excitation vector generator explained in the 
first embodiment to the random codebook of the CELP speech coder of FIG. 1 . Also, the CELP speech decoder accord- 
ing to this embodiment applies the excitation vector generator explained in the first embodiment to the random code- 
book of the CELP speech decoder of FIG. 2. Therefore, processing other than vector quantization processing for 

35 random excitation is the same as that of the apparatuses of FIGS. 1 and 2. This embodiment will explain the speech 
coder and the speech decoder with particular emphasis on vector quantization processing for random excitation. Also, 
similar to the first embodiment, the generation of pulse vectors are based on Table 1 wherein the number of channels 
N = 3 and the number of dispersion patterns for one channel M = 2. 

[0052] The vector quantization processing for random excitation in the speech coder illustrated in FIG. 4 is one that 
40 specifies two kinds of indices (combination index for dispersion patterns and combination index for pulse positions and 
pulse polarities) so as to maximize reference values in expression (4). 

[0053] In a case where the excitation vector generator illustrated in FIG. 3 is used as a random codebook, combina- 
tion index for dispersion patterns (eight kinds) and combination index for pulse vectors (case considering the polarity: 
1 6384 kinds) are searched by a closed loop. 

45 [0054] For this reason, a dispersion pattern storing and selecting section 215 selects either of two kinds of dispersion 
patterns stored in the dispersion pattern storing and selecting section itself, and outputs the selected dispersion pattern 
to a pulse vector dispersion section 217. Thereafter, a pulse vector generator 216 algebraically generates pulse vectors 
corresponding to the number of channels (three in this embodiment) in accordance with the rule described in Table 1 , 
and outputs the generated pulse vectors to the pulse vector dispersion section 217. 

so [0055] The pulse vector dispersion section 21 7 generates a dispersed vector for each channel by a convolution cal- 
culation. The convolution calculation is performed on the basis of the expression (5) using the dispersion patterns 
selected by the dispersion pattern storing and selecting section 215 and the signed pulses generated by the pulse vec- 
tor generator 216. 

[0056] A dispersion vector adding section 218 adds up the dispersed vectors obtained by the pulse vector dispersion 
55 section 21 7, thereby generating excitation vectors (candidates for random codevectors). 

[0057] Then, a distortion calculator 206 calculates evaluation values according to the expression (4) using the random 
code vector candidate obtained by the dispersed vector adding section 218. The calculation on the basis of the expres- 
sion (4) is carried out with respect to all combinations of the pulse vectors generated based on the rule of Table 1 . Then, 
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amona the calculated values, the combination index for dispersion patterns and the combination index for pulse vectors 
(combination ot the pulse positions and the polarities), which are obtained when the evaluation value by the expression 
(4) becomes maximum and the maximum value are output to a code number specifying section 213. 
foossi Next the^Jispersion pattern storing and selecting section 215 selects the combination for dispersion patterns 
which is different from the previously selected combination for the dispersion patterns. Regarding the cornb.nat.on for 
dispersion patterns newly selected, the calculation of the value of expression (4) is carried out wrth respect to all com- 
binations of the pulse vectors generated by the pulse vector generator 216 based on the rule of Table 1 . Then, among 
the calculated values, the combination index for dispersion patterns and the combination index tor pulse vectors, which 
are obtained when the value of expression (4) becomes maximum and the maximum value are output to the code indi- 
ces soecitving section 213 again. t . _ ._ 
[0059] The above processing is repeated with respect to all combinations (total number of comb.nat.ons is eight in 
this embodiment) selectable from the dispersion patterns stored in the dispersion pattern storing and selecting section 

(00601 The code indices specifying section 213 compares eight maximum values in total calculated by the abortion 
calculator 206 and selects the highest value of all. Then, the code indices specifying section 213 specrf.es two tands of 
combination indices (combination index for dispersion patterns, combination index for pulse vectors) wh.cn are 
obtained when the highest value is generated, and outputs the specified combination indices to a code outputbng sec- 
tion 214 as an index of random codebook. 

[0061] On the other hand, in the speech decoder of FIG. 5. a code inputting section 301 receives codesl ^™*ted 
from the speech coder (FIG. 4). decomposes the received codes into the corresponding index of LPC codebook. the 
index of adaptive codebook. the index of random codebook (composed of two kinds of the combination index for dis- 
persion patterns and combination index for pulse vectors) and the index of weight codebook. Then, the code inputting 
section 301 outputs the decomposed indicies to a linear prediction coefficient decoder 302. an adaptive codebook. a 
random codebook 304. and a weight codebook 305. Note that, in the random code number, that the comb.nat.on index 
for dispersion patterns is output to a dispersion pattern storing and storing section 31 1 and the combination index for 
pulse vectors is output to a pulse vector generator 312. . ~ 

[0062] Then the linear prediction coefficient decoder 302 decodes the linear pred.ctrve code number, obtains the 
coefficients for a synthetic filter 309. and outputs the obtained coefficients to the synthetic filter 309. In the adaptive 
codebook 303 an adaptive codevector corresponding to the index of adaptive codebook is read from. 
10063] In the random codebook 304. the dispersion pattern storing and selecting section 31 1 reads the dtspersion 
patterns corresponding to the combination index for dispersion pulses in every channel, and outputs the resultant to a 
pulse vector dispersion section 313. The pulse vector generator 312 generates the pulse vectors corresponding to the 
combination index for pulse vectors and corresponding to the number of channels, and outputs the resultant to the 
pulse vector dispersion section 313. The pulse vector dispersion section 313 generates a dispersed vector for each 
channel by convolving the dispersion patterns received from the dispersion pattern storing and selecting section 31 1 
on the singed pulses received from the pulse vector generator 312. Then, the generated dispersed vectors are output 
to a dispersion vector adding section 314. The dispersion vector adding section 314 adds up the dispersed vectors of 
the respective channels generated by the pulse vector dispersion section 313. thereby generating a random 
[0064] Then an adaptive codebook gain and a random codebook gain corresponding to the index of weight codebook 
are read from the weight codebook30S. Then, in an adaptive code vector weighting section 306. the adaptive codevec- 
tor is multiplied by the adaptive codebook gain. Similarly in a random code vector weighting section 307, the random 
codevector is multiplied by the random codebook gain. Then, these resultants are output to an adding section 308. 
[0065] The adding section 308 adds up the above two code vectors multiplied by the gains so as to generate an exci- 
tation vector. Then, the adding section 308 outputs the generated excitation vector to the adaptive codebook 303 to 
update a buffer or to the synthetic filter 309 to excite the synthetic filter. 

[00661 The synthetic filter 309 is excited by the excitation vector obtained by the adding section 308, and reproduces 
a synthetic speech 310. Also, the adaptive codebook 303 updates the buffer by the excitation vector received from the 

adding section 308. , . . .. u 

[0067] In this case suppose that the dispersion patterns obtained by pre-training are stored for each channel in the 
i dispersion pattern storing and selecting section of FIGS. 4 and 5 such that a value of cost function becomes smaller 
wherein the cost function is a distortion evaluation expression (7) in which the excitation vector descnbed in expression 
(6) is substituted into c of expression (2). 
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where x: target vector for specifying index of random codebook, 
gc: random codebook gain, 

H: impulse response convolution matrix for synthetic filter, 

C: random codevector, 

i: channel number (ii = 1-N), 

j: dispersion pattern number (j « 1 -M ) 

ci: dispersion vector for channel i, 

wij: dispersion patterns for channels i-th, j-th kinds, 

di: pulse vector for channel i. and 

L: excitation vector length (n = 0-L-1 ). 



[0068] The above embodiment explained the case in which the dispersion patterns obtained by pre-training were 
stored M by M for each channel in the dispersion pattern storing and selecting section such that the value of cost func- 
tion expression (7) becomes smaller. However, in actual, all M dispersion patterns do not have to be obtained by train- 
25 ing. If at least one kind of dispersion pattern obtained by training is stored, it is possible to obtain the functions and 
effects to improve the quality of the synthesized speech. 

[0069] Also, the above embodiment explained that case in which from all combinations of dispersion patterns stored 
in the dispersion pattern storing and selecting section stores and all combinations of pulse vector position candidates 
generated by the pulse vector generator, the combination index that maximized the reference value of expression (4) 
30 was specified by the closed bop. However, the similar functions and effects can be obtained by carrying out a pre-selec- 
tion based on other parameters (ideal gain for adaptive codevector, etc.) obtained before specifying the index of the ran- 
dom codebook or by a open loop search. 

[0070] Moreover, a speech signal communication system or a speech signal recording system having the above the 
speech coder/decoder is structured, thereby obtaining the functions and effects which the excitation vector generator 
35 described in the first embodiment has. 

(Third embodiment) 

[0071] FIG. 6 is a functional block of a CELP speech coder according to the third embodiment According to this 
40 embodiment, in the CELP speech coder using the excitation vector generator of the first embodiment in the random 
codebook, a pre-selection for dispersion patterns stored in the dispersion pattern storing and selecting section is car- 
ried out using the value of an ideal adaptive codebook gain obtained before searching the index of random codebook. 
The other portions of the random codebook peripherals are the same as those of the CELP speech coder of FIG. 4. 
Therefore, this embodiment will explain the vector quantization processing for random excitation in the CELP speech 
45 coder of FIG. 6. 

[0072] This CELP speech coder comprises an adaptive codebook 407, an adaptive codebook gain weighting section 
409, a random codebook 408 constituted by the excitation vector generator explained in the first embodiment, a random 
codebook gain weighting section 410. a synthetic filter 405. a distortion calculator 406, an indices specifying section 
413, a dispersion pattern storing and selecting section 415. a pulse vector generator 41 6, a pulse vector dispersion sec- 
50 tion 41 7, a dispersed vector adding section 41 8, and a distortion power juding section 41 9. 

[0073] In this case, according to the above embodiment, suppose that at least one of M (M = s2) kinds of dispersion 
patterns stored in the dispersion pattern storing and selecting section 41 5 is the dispersion pattern that is obtained from 
the result by performing a pre-training to reduce quantization distortion generated in vector quantization processing for 
random excitation 

55 [0074] In this embodiment, for simplifying the explanation, tt is assumed that the number N of channels of the pulse 
vector generator is 3, and the number M of kinds of dispersion patterns for each channel stored in the dispersion pattern 
storing and selecting section is 2. Also, suppose that one of M (M = 2) kinds of dispersion patterns is dispersion pattern 
obtained by the above-mentioned training, and other is random vector sequence (hereinafter referred to as random pat- 
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tern) which is generated by a random vector generator. Additionally, it is known that the dispersion pattern obtained by 
the above training has a relatively short length and a pulse-like shape as in w11 of FIG. 3. 

[0075] In the CELP speech coder of FIG. 6. processing for specifying the index of the adaptive codebook before vector 
quantization of random excitation is carried out. Therefore, at the time when vector quantization processing of random 
excitation is carried out it is possible to refer to the index of the adaptive codebook and the ideal adaptive codebook 
gain (temporarily decided). In this embodiment, the ore-selection for dispersion patterns is carried out using the value 
of the ideal adaptive codebook gain. 

[0076] More specifically, first, the ideal value of the adaptive codebook gain stored in the code indices specifying sec- 
tion 41 3 just after the search for the index of adaptive codebook is output to the distortion calculator 406. The distortion 
calculator 406 outputs the adaptive codebook gain received from the code indices specifying section 413 to the adap- 
tive codebook gain judging section 419. 

[0077] The adaptive gain judging section 41 9 performs a comparison between the value of the ideal adaptive code- 
book gain received from the distortion calculator 409 and a preset threshold value. Next the adaptive codebook gain 
judging section 419 sends a control signal for a pre-selection to the dispersion pattern storing and selecting section 415 
based on the result of the comparison. The contents of the control signal will be explained as follows. 
[0078] More specifically, when the adaptive codebook gain is larger than the threshold value as a result of the com- 
parison, the control signal provides an instruction to select the dispersion pattern obtained by the pre-training to reduce 
the quantization distortion in vector quantization processing for random excitations. Also, when the adaptive code gain 
is not larger than the threshold value as a result of the comparison, the control signal provides an instruction to carry 
out the pre-selection for the dispersion pattern different from the dispersion pattern obtained from the resuH of the pre- 
training. 

[0079] As a consequence, in the dispersion pattern storing and selecting selection 415, the dispersion pattern of M 
(M = 2) kinds, which the respective channels store, can be pre-selected in accordance with the value of the ideal adap- 
tive codebook gain, so that the number of combinations of dispersion patterns can be largely reduced. This eliminates 
the need of the distortion calculation for all the combinations of the dispersion patterns, and makes it possible to effi- 
ciently perform the vector quantization processing for random excitation with a small amount of calculations. 
[0080] Moreover, the random codevector is pulse-like shaped when the value of the adaptive gain is large (this seg- 
ment is determined as voiced) and is randomly shaped when the value of the adaptive gain is small (this segment is 
determined as unvoiced). Therefore, since the random code vector having a suitable shape for each of the voice seg- 
ment the speech signal and the non-voice segment can be used, the quality of the synthtic speech can be improved. 
[0081 ] Due to the simplification of the explanation, this embodiment explained limitedly the case in which the number 
N of channels of the pulse vector generator was 3 and the number M of kinds of the dispersion patterns was 2 per chan- 
nel stored in the dispersion pattern storing and selecting section. However, similar effects and functions can be obtained 
in a case in which the number of channels of the pulse vector generator and the number of kinds of the dispersion pat- 
terns per channel stored in the dispersion pattern storing and selecting section are different from the aforementioned 
case. 



[0082] Also, due to the simplification of the explanation, the above entxxJiment explained the case in which one of M 
kinds (M = 2) of dispersion patterns stored in each channel was dispersion patterns obtained by the above training and 
the other was random patterns. However, if at least one kind of dispersion pattern obtained by the training is stored for 
each channel, the similar effects and functions can be expected instead of the above-explained case. 
[0083] Moreover, this embodiment explained the case in which large and small information of the adaptive codebook 
gain was used in means for performing pre-selection of the dispersion patterns. However, if other parameters showing 
a short-time character of the input speech are used in addition to large and small information of the adaptive codebook 
gain, the similar effects and functions can be further expected. 

[0084] Further, a speech signal communication system or a speech signal recording system having the above the 
speech coder/decoder is structured, thereby obtaining the functions and effects which the excitation vector generator 
described in the first embodiment has. 

[0085] In the explanation of the above embodiment there was explained the method in which the pre-selection of the 
dispersion pattern was carried out using the ideal adaptive codebook gain of the current frame at the time when vector 
quantization processing of random excitation was performed. However, the similar structure can be employed even in 
a case in which a decoded adaptive codebook gain obtained in the previous frame is used instead of the ideal adaptive 
codebook gain in the current frame. In this case, the similar effects can be also obtained. 

(Fourth embodiment) 

[0086] FIG. 7 is a functional block diagram of a CELP speech coder according to the fourth embodiment In this 
embodiment, in the CELP speech coder using the excitation vector generator of the first embodiment in the random 
codebook, a pre-selection for a plurality of dispersion patterns stored in the dispersion pattern storing and selecting 
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section is carried out using available information at the time of vector quantization processing for random excitations. It 
is characterized that a value of a coding distortion (expressed by an S/N ratio), that is generated in specifying the index 
of the adaptive codebook. is used as a reference of the pre-selection. 

[0087] Note that the other portions of the random codebook peripherals are the same as those of the CELP speech 
5 coder of FIG. 4. Therefore, this embodiment will specifically explain the vector quantization processing for random exci- 
tation. 

[0088] As shown in FIG. 7, this CELP speech coder comprises an adaptive codebook 507, an adaptive codebook gain 
weighting section 509, a random codebook 508 constituted by the excitation vector generator explained in the first 
embodiment, a random codebook gain weighting section 510. a synthetic fitter 505, a distortion calculator 506, a code 
w indices specifying section 513, a dispersion pattern storing and selecting section 515, a pulse vector generator 516, a 
pulse vector dispersion section 51 7, a dispersed vector adding section 518, and a coding distortion judging section 519. 
[0089] In this case, according to the above embodiment, suppose that at least one of M (M = s2) kinds of dispersion 
patterns stored in the dispersion pattern storing and selecting section 51 5 is the random pattern. 
[0090] In the above embodiment, for simplifying the explanation, the number N of channels of the pulse vector gen- 
ts erator is 3 and the number M of kinds of the dispersion patterns is 2 per channel stored in the dispersion pattern storing 
and selecting section. Moreover, one of M (M = 2) kinds of dispersion patterns is the random pattern, and the other is 
the dispersion pattern that is obtained as the result of pre- training to reduce quantization distortion generated in vector 
quantization processing for random excitations. 

[0091 ] In the CELP speech coder of FIG. 7, processing for specifying the index of the adaptive codebook is performed 

20 before vector quantization processing for random excitation. Therefore, at the time when vector quantization processing 
of random excitation is carried out it is possible to refer to the index of the adaptive codebooK the ideal adaptive code- 
book gain (temporarily decided), and the target vector for searching the adaptive codebook. In this embodiment, the 
pre-selection for dispersion patterns is carried out using the coding distortion (expressed by S/N ratio) of the adaptive 
codebook which can be calculated from the above three information. 

25 [0092] More specifically, the index of adaptive codebook and the vaJue of the adaptive codebook gain (ideal gain) 
stored in the code indices specifying section 513 just after the search for the adaptive codebook is output to the distor- 
tion calculator 506. The distortion calculator 506 calculates the coding distortion (S/N ratio) generated by specifying the 
index of the adaptive codebook using the index of adaptive codebook received from the code indices specifying section 
513, the adaptive codebook gain, and the target vector for searching the adaptive codebook. Then, the distortion cal- 

30 culator 506 outputs the calculated S/N value to the coding distortion juding section 51 9. 

[0093] The coding distortion juding section 519 performs a comparison between the S/N value received from the dis- 
tortion calculator 506 and a preset threshold value. Next the coding distortion juding section 519 sends a control signal 
for a pre-selection to the dispersion pattern storing and selecting section 515 based on the result of the comparison. 
The contents of the control signal will be explained as follows. 

35 [0094] More specifically, when the S/N value is larger than the threshold value as a result of the comparison, the con- 
trol signal provides an instruction to select the dispersion pattern obtained by the pre-training to reduce the quantization 
distortion generated by coding the target vector for searching the random codebook. Also, when the S/N value is 
smaller than the threshold value as a result of the comparison, the control signal provides an instruction to select the 
non-pulse-like random patterns. 

40 [0095] As a consequence, in the dispersion pattern storing and selecting selection 515, only one kind is p re-selected 
from M (M = 2) kinds of dispersion patterns, which the respective channels store, so that the number of combinations 
of dispersion patterns can be largely reduced. This eliminates the need of the distortion calculation for all the combina- 
tions of the dispersion patterns, and makes it possible to efficiently specify the index of the random codebook with a 
small amount of calculations. 

45 [0096] Moreover, the random codevector is pulse-like shaped when the S/N value is large, and is non-pulse-like 
shaped when the S/N value is small. Therefore, since the shape of the random codevector can be changed in accord- 
ance with the short-time characteristic of the speech signal, the quality of the synthetic speech can be improved. 
[0097] Due to the simplification of the explanation, this embodiment explained limrtediy the case in which the number 
N of channels of the pulse vector generator was 3 and the number M of kinds of the dispersion patterns was 2 per chan- 

so nel stored in the dispersion pattern storing and selecting section. However, similar effects and functions can be obtained 
in a case in which the number of channels of the pulse vector generator and the number of kinds of the dispersion pat- 
terns per channel stored in the dispersion pattern storing and selecting section are different from the aforementioned 
case. 

[0098] Also, due to the simplification of the explanation, the above embodiment explained the case in which one of M 
55 kinds (M = 2) of dispersion patterns stored in each channel was dispersion patterns obtained by the above pre-training 
and the other was random patterns. However, if at least one kind of random dispersion pattern is stored for each chan- 
nel, the similar effects and functions can be expected instead of the above-explained case. 

[0099] Moreover, this embodiment explained the case in which only large and small information of coding distortion 
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described in the first embodiment has. 
(Fifth embodiment) 

x. ^ «i ui^u m a rpi p <;neech coder according to the fifth embodiment of the present 
10101] RG. *J^£gZ*»* ^Mn^S^ng^tion'eOO performs a se«-corre.ation analysis 

shown by the following expression (8). 
55 X(0 = {S f (/>p(dCO+S f+1 (i)fiWl+P) (8) 



where Xp): target vector, 
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i: vector element number, 
St(D. St +1 (i): input vector, 
t: time (frame number), 
p: weighting coefficient (fixed), and 
5 d(i): decoded vector of previous frame. 

[0113] The following will show a concept of the above target extraction, method. In a typical vector quantization, 
parameter vector S t (i) is used as target X(i) and a matching is performed by the following expression (9): 

w t 

En = £(X(/>Cn(/)) 2 (9) 
/-o 



15 where En: distance from n-th code vector, 

X(i): target vector, 

Cn(i): code vector, 

n: code vector number, 

i: order of vector, and 
20 I: length of vector. 

[01 1 4] Therefore, in the conventional vector quantization, the coding distortion directly leads to degradation in speech 
quality. This was a big problem in the ultra-low bit rate coding in which the coding distortion cannot be avoided to some 
extent even if measurements such as prediction vector quantization is taken. 
25 [01 1 5] For this reason, according to this embodiment, attention should be paid to a middle point of the decoded vector 
as a direction where the user does not perceptually feel an error easily, and the decoded vector is induced to the middle 
point so as to realize perceptual improvement In the above case, there is used a characteristic in which time continuity 
is not easily heard as a perceptual degradation. 

[01 1 6] The following will explain the above state with reference to FIG. 1 0 showing a vector space. 

30 [01 17] First of all, it is assumed that the decoded vector of one previous frame is d(i) and a future parameter vector 
is St +1 (i) (although a future coded vector is actually desirable, the future parameter vector is used for the future coded 
vector since the coding cannot be carried out in the current frame. In this case, although the code vector Cn(i): (1) is 
closer to the parameter vector St(i) than the code vector Cn(i): (2), the code vector Cn(i): (2) is actually close onto a line 
connecting d(i) and S, +1 (i). For this reason, degradation is not easily heard as compared with (1). Therefore, by use of 

35 the above characteristic, if the target X(i) is set as a vector placed at the position where the target X(i) approaches to 
the middle point between d(i) and St +1 (i) from St(i) to some degree, the decoded vector is induced to a direction where 
the amount of distortion is perceptually slight. 

[01 18] Then, according to this embodiment, the movement of the target can be realized by introducing the following 
evaluation expression (10) 

40 

X{i) = (S f (/) + p(d{i) + S f+1 (/)/2}/(1 + p) (10) 

where X(i): target vector, 
i: vector element number, 
45 S^i). St +1 (i): input vector, 
t: time (frame number), 
p: weighting coefficient (fixed), and 
d(i): decoded vector of previous frame. 

so [01 19] The first half of expression (10) is a general evaluation expression, and the second half is a perceptual com- 
ponent. In order to carry out the quantization by the above evaluation expression, the evaluation expression is differen- 
tiated with respect to each X(i) and the differentiated result is set to 0, so that expression (8) can be obtained. 
[0120] Note that the weighting coefficient p is a positive constant. Specifically, when the weighting coefficient p is zero, 
the result is similar to the general quantization when the weighting coefficient p is infinite, the target is placed at the 

55 completely middle point If the weighting coefficient p is too large, the target is largely separated from the parameter 
S t (i) of the cunertt frame so that articulation is perceptually reduced. The test listening of decoded speech confirms that 
a good performance with 0.5<p<1 .0 can be obtained. 

[0121] Next, in the quantizing section 703, the quantization target obtained by the target extracting section 702 is 
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quantized so as to obtain a vector code and a decoded vector, and the obtained vector index and decoded vector are 
sent to the distortion calculator 704. 

[0122] Note that a predictive vector quantization is used as a quantization method in this embodiment The following 

will explain the predictive vector quantization. 
5 [0123] FIG. 1 1 shows a functional block of the predictive vector quantization. The predictive vector quantization is an 

algorithm in which the prediction is earned out using the vector (synthesized vector) obtained by coding is quantized. 

[0124] A vector codebook 800. which stores a plurality of main samples (codevectors) of the prediction error vectors. 

is prepared in advance. "This is prepared by an LBG algorithm (IEEE TRANSACTIONS ON COMMUNICATIONS. VOL 

COM-28, NO. 1, PP84-95, JANUARY 1980) based on a large number of vectors obtained by analyzing a large amount 
10 of speech data. 

[0125] A vector 801 for quantization target is predicted by a prediction section 802. The prediction is carried out by 
the post-decoded vectors stored in a state storing section 803. and the obtained predictive error vector is sent to a dis- 
tance calculator 804. Here, as a form of prediction, a first prediction order and a fixed coefficient are used. Then, an 
expression for calculating the predictive error vector in the case of using the above prediction is shown by the following 
is expression (11). 

Y(i) = X(i)-pD(i) (11) 

where Y(i): predictive error vector, 
20 X(i): target vectpr, 

p: prediction coefficient (scalar) 

D(i): decoded vector of one previous frame, and 

i: vector order. 

25 [01 26] In the above expression, it is general that the prediction coefficient p is a value of 0 < p < 1 . 

[01 27] Next, the distance calculator 804 calculates the distance between the predictive error vector obtained by the 
prediction section 802 and the codevector stored in codebook 800. An expression for obtaining the above distance is 
shown by the following expression (12): 

30 I 

E/?-£(T(/)- Cn(i)) 2 (12) 



35 where En: distance from n-th code vector, 

Y(i): predictive error vector, 

Cnfi): codevector, 

n: codervector number, 

I: vector order, and 
40 I: vector length. 

[0128] Next, in a searching section 805, the distances for respective codevectors are compared, and the index of 
codevector which gives the shortest distance is output as a vector code 806. 

[0129] In other words, the vector codebook 800 and the distance calculator 804 are controlled so as to obtain the 
45 index of codevector which gives the shortest distance from all codevectors stored in the vector codebook 800. and the 
obtained index is used as vector code 806. 

[0130] Moreover, the vector is coded using the code vector obtained from the vector codebook 800 and the past- 
decoded vector stored in the state storing section 803 based on the final coding, and the content of the state storing 
section 803 is updated using the obtained synthesized vector. Therefore, the decoded vector here is used in the predic- 
so tion when a next quantization is performed. 

[0131] The decoding of the example (first prediction order, fixed coefficient) in the above-mentioned prediction form 
is performed by the following expression (13): 

Zffl-CNfi) + pDTi) (13) 

55 

where Z(i): decoded vector (used as D(i) at a next coding time, 
N:code for vector, 
CN(i): code vector. 
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p: prediction coefficient (scalar), 

D(i): decoded vector of one previous frame, and 

i: vector order. 

5 [0132] On the other hand, in a decoder, the code vector is obtained based on the code of the transmitted vector so 
as to be decoded. In the decoder, the same vector codebook and state storing section as those of the coder are pre- 
pared in advance. Then, the decoding is carried out by the same algorithm as the decoding function of the searching 
section in the aforementioned coding algorithm. The above is the vector quantization, which is executed in the quantiz- 
ing section 703. 

10 [01 33] Next, the distortion calculator 704 calculates a perceptual weighted coding distortion from the decoded vector 
obtained by the quantizing section 703, the input vector 701 , and the decoded vector of the previous frame stored in the 
decoded vector storing section 707. An expression for calculation is shown by the following expression (1 4): 

Ew = £( V(i)-S,(/)) 2 +p{V(i)-{d{t)+S t ^[i)f2} 2 (14) 

15 

where Ew: weighted coding distortion, 
St(D. St +1 ('): input vector, 
t: time (frame number) 
20 i: vector element number, 
V(i): decoded vector, 
p: weighting coefficient (fixed), and 
d(t): decoded vector of previous frame. 

25 [01 34] In expression (1 4), the weighting efficient p is the same as the coefficient of the expression of the target used 
in the target extracting section 702. "Then, the value of the weighted coding distortion, the encoded vector and the code 
of the vector are sent to the comparator 705. 

[0135] The comparator 705 sends the code of the vector sent from the distortion calculator 704 to the transmission 
path 608. and further updates the content of the decoded vector storing section 707 using the vector sent from the dis- 
30 tortion calculator 704. 

[0136] According to the above-mentioned embodiment, in the target extracting section 702, the target vector is cor- 
rected from S t (i) to the vector placed at the position approaching to the middle point between D(i) and St +1 (i) to same 
extent. This makes it possible to perform the weighted search so as not to arise perceptual degradation. 
[0137] The above explained the case in which the present invention was applied to the low bit rate speech coding 
35 technique used in such as a cellular phone. However, the present invention can be employed in not only the speech 
coding but also the vector quantization for a parameter having a relatively good interpolation in a music coder and an 
image coder. 

[0138] In general, the LPC coding executed by the LPC analyzing section in the above-mentioned algorithm, conver- 
sion to parameters vector such as LPS (Line Spectram Pairs), which are easily coded, is commonly performed, and 

40 vector quantization (VQ) is carried out by Euclidean distance or weighted Euclidean distance. 

[0139] Also, according to the above embodiment, the target extracting section 702 sends the input vector 701 to the 
vector smoothing section 708 after being subjected to the control of the comparator 705. Then, the target extracting 
section 702 receives the input vector changed by the vector smoothing section 708, thereby re-extracting the target. 
[0140] In this case, the comparator 705 compares the value of weighted coding distortion sent from the distortion cal- 

45 culator 704 with a reference value prepared in the comparator. Processing is divided into two. depending on the com- 
parison result 

[0141] H the comparison result is under the reference value, the comparator 705 sends the index of the codevector 
sent from the distortion calculator to the transmission path 608, and updates the content of the decoded vector storing 
section 707 using the coded vector sent from the distortion calculator 704. This update is carried out by rewriting the 
so content of the decoded vector storing section 707 using the obtained coded vector. Then, processing moves to one for 
a next frame parameter coding. 

[0142] While, if the comparison result is more than the reference value, the comparator 705 controls the vector 
smoothing section 708 and adds a change to the input vector so that the target extracting section 702, the quantizing 
section 703 and distortion calculator 704 are functioned again to perform coding again. 
55 [0143] In the comparator 705. coding processing is repeated until the comparison result reaches the value under ref- 
erence value. However, there is a case in which the comparison result can not reache the value under the reference 
value even if coring processing is repeated many times. In case, the comparator 705 provides a counter in its interior, 
and the counter counts the number of times wherein the comparison result is determined as being more than the refer- 
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ence value. When the number of times is more than a fixed number of times, the comparator 705 stops the repetition of 
coding and clears the comparison result and counter state, then adopts initial index. 

[0144] The vector smoothing section 708 is subjected to the control of the comparator 705 and changes parameter 
vector St(i) of the current frame, which is one of input vectors, from the input vector obtained by the target extracting 
section 702 and the decoded vector of the previous frame obtained decoded vector storing section 707 by the following 
expression (15), and sends the changed input vector to the target extracting section 702. 

S t (i) <-(1-q) • S t (i) + q(d(0 + S t+1 (0V2 (15) 

[0145] In the above expression, q is a smoothing coefficient, which shows the degree of which the parameter vector 
of the current frame is updated close to a middle point between the decoded vector of the previous frame and the 
parameter vector erf the future frame. The coding experiment shows that good performance can be obtained when the 
upper limitation of the number of repetition executed by the interior of the comparator 705 is 5 to 8 under the condition 
ofO. 2<q<0.4. 

[0146] Although the above embodiment uses the predictive vector quantization in the quantizing section 703, there is 
a high possibility that the weighted coding distortion obtained by the distortion calculator 704 will become small. This is 
because the quantized target is updated closer to the decoded vector of the previous frame by smoothing. Therefore, 
by the repetition of decoding the previous frame due to the control of the comparator 705, the possibility that the com- 
parison result will become under the reference value is increased in the distortion comparison of the comparator 705. 
[01 47] Also, in the decoder, there is prepared a decoding section corresponding to the quantizing section of the coder 
in advance such that decoding is carried out based on the index of the codevector transmitted through the transmission 
path. 

[01 48] Also, the embodiment of the present invention was applied to quantization (quantizing section is prediction VQ) 
of LSP parameter appearing CELP speech coder, and speech coding and decoding experiment was performed. As a 
result, it was confirmed that not only the subjective quality but also the objective value (S/N value) could be improved. 
This is because there is an effect in which the coding distortion of predictive VQ can be suppressed by coding repetition 
processing having vector smoothing even when the spectrum drastically changes. Since the future prediction VQ was 
predicted from the past-decoded vectors, there was a disadvantage in which the spectral distortion of the portion where 
the spectrum drastically changes such as a speech onset contrarily increased. However, in the application of the 
embodiment of the present invention, since smoothing is carried out until the Distortion lessens in the case where the 
distortion is large, the coding distortion becomes small though the target is more or less separated from the actual 
parameter vector. Whereby, there can be obtained an effect in which degradation caused when decoding the speech is 
totally reduced. Therefore, according to the embodiment of the present invention, not only the subjuctive quality but also 
the objective value can be improved. 

[01 49] In the above-mentioned embodiment of the present invention, by the characteristics of the comparator and the 
vector smoothing section, control can be provided to the direction where the operator does not perceptually feel the 
direction of degradation in the case where the vector quantizing distortion is large. Also, in the case where predictive 
vector quantization is used in the quantizing section, smoothing and coding are repeated until the coding distortion less- 
ens, thereby the objective value can be also improved. 

[01 50] The above explained the case in which the present invention was applied to the low bit rate speech coding 
technique used in such as a cellular phone. However, the present invention can be employed in not only the speech 
coding but also the vector quantization for a parameter having a relatively good interpolation in a music coder and an 
image coder. 

(Sixth embodiment) 

[0151] Next, the following will explain the CELP speech coder according to the sixth embodiment. The configuration 
of this embodiment is the same as that of the fifth embodiment excepting quantization algorithm of the quantizing sec- 
tion using a multi-stage predictive vector quantization as a quantizing method. In other words, the excitation vector gen- 
erator of the first embodiment is used as a random codebook. Here, the quantization algorithm of the quantizing section 
will be specifically explained. 

[01 52] FIG. 1 2 shows the functional block of the quantizing section. In the multi-stage predictive vector quantization, 
the vector quantization of the target is carried out thereafter the vector is decoded using a codebook with the index of 
the quantized target a difference between the coded vector. Then, the original target (hereinafter referred to as coded 
distortion vector) is obtained, and the obtained coded distortion vector is further vector-quantized. 
[01 53] A vector codebook 899 in which a plurality of dominant samples (codevectors) of the predictive error vector are 
stored and a codebook 900 are generated in advance. These codevectors are generated by applying the same algo- 
rithm as that of the codevector generating method of the typical "multi-vector quantization". In other words, these code- 
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vectors are generally generated by an LBG algorithm (IEEE TRANSACTIONS ON COMMUNICATIONS, VOL COM- 
28, NO. 1. PP84-95, JANUARY 1980) based on a large number of vectors obtained by analyzing many speech data. 
Note that, a training date for designing codevectors 899 is a set of many target vectors, while a training date for design- 
ing codebook 900 is a set of coded distortion vectors obtained when the above-quantized targets are coded by the vec- 
5 tor codebook 899. 

[0154] First, a vector 901 of the target vector is predicted by a predicting section 902. The prediction is earned out by 
the past-decoded vectors stored in a state storing section 903, and the obtained predictive error vector is sent to dis- 
tance calculators 904 and 905. 

[01 55] According to the above embodiment, as a form of prediction, a fixed coefficient is used for a first order predic- 
w tion. Then, an expression for calculating the predictive error vector in the case of using the above prediction is shown 
by the following expression (16). 

Y(i)=X(i)-pD(i) (16) 

is where Y(i): predictive error vector, 
X(i): target vector, 
p; predictive coefficient (scalar), 
D(i): decoded vector of one previous frame, and 
i: vector order. 

20 

[0156] In the above expression, it is general that the predictive coefficient p is a value of 0 < p < 1 . 
[0157] Next, the distance calculator 904 calculates the distance between the predictive error vector obtained by the 
prediction section 902 and code vector A stored in the vector codebook 899. An expression for obtaining the above dis- 
tance is shown by the following expression (17): 

25 

t 

En=£(X(0- C1n(0) 2 (17) 
/-o 

30 

where En: distance from n-th code vector A 
Y(i): predictive error vector, 
C 1 n(i) : codevector A, 
n: index of codervector A, 
35 l: vector order, and 
I: vector length. 

[01 58] Then, in a searching section 906, the respective distances from the codevector A are compared, and the index 
of the code vector A having the shortest distance is used as a code for code vector A. In other words, the vector code- 

40 book 899 and the distance calculator 904 are controlled so as to obtain the code of codevector A having the shortest 
distance from all codevectors stored in the codebook 899. Then, the obtained code of codevector A is used as the index 
of codebook 899. After this, the code for codevector A and decoded vector A obtained from the codebook 899 with ref- 
erence to the code for codevector A are sent to the distance calculator 905. Also, the code for codevector A is sent to a 
searching section 906 through the transmission path. 

45 [01 59] The distance calculator 905 obtains a coded distortion vector from the predictive error vector and the decoded 
vector A obtained from the searching section 906. Also, the distance calculator 905 obtains amplitude from an amplifier 
storing section 908 with reference to the code for codevector A obtained from the searching section 906. Then, the dis- 
tance calculator 905 calculates a distance by multiplying the above coded distortion vector and codevector B stored in 
the vector codebook 900 by the above amplitude, and sends the obtained distance to the searching section 907. An 

so expression for the above distance is shown as follows: 

Z(i) = Y(i)-C1N(i) 

/ 

55 Em = £(Z(/)-a/VC2m(/)) 2 (18) 
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where Z(i): decoded vector, 

Y(i): predictive error vector. 

CI N(i): decoded vector A, 

Em : distance from m-th code vector B, 

aN : amplitude corresponding to the code for codevector A, 

C2m(i): codevector B, 

m : index of codevector B, 

i : vector order, and 

I : vector length 

[01 60] Then, in a searching section 907, the respective distances from the codevector B are compared, and the index 
of the codevector B having the shortest distance is used as a code for codevector B. In other words, the codebook 900 
and the distance calculator 905 are controlled so as to obtain the code of codevector B having the shortest distance 
from all codevectors stored in the vector codebook 900. Then, the obtained code of codevector B is used as the index 
of codebook 900. After this, codevector A and codevector B are added and used as a vector code 909. 
[0161] Moreover, the searching section 907 carries out the decoding of the vector using decoded vectors A, B 
obtained from the vector codebooks 899 and 900 based on the codes for codevector A and codevector B, amplitude 
obtained from an amplifier storing section 908 and past decoded vectors stored in the state storing section 903. The 
content of the state storing section 903 is updated using the obtained decoded vector. (Therefore, the vector as 
decoded above is used in the prediction at a next coding time). The decoding in the prediction (a first prediction order 
and a fixed coefficient) in this embodiment is performed by the following expression (19): 

Z(i) = C1 N(i) + aN - C2M(i) + pD(i) (19) 

where Z(i): decoded vector (used as D(i) at the next coding time), 

N: code for codevector A, 

M: code for codevector B, 

C1 N(i): decoded codevector A, 

C2M(i): decoded codevector B, 

aN : amplitude corresponding to the code for codevector A, 

p: predictive coefficient (scalar), 

D(i): decoded vector of one previous frame, and 

i: vector order. 

[0162] Also, although amplitude stored in the amplifier storing section 908 is preset the setting method is set forth 
below. The amplitude is set by coding much speech data is coded, obtaining the sum of the coded distortions of the 
following expression (20), and performing the training such that the obtained sum is minimized. 

/ 

EN = ££( V f (/>C1 N(i)-aNC2m t {i)) 2 A (20) 

i=0 



where EN: coded distortion when the code for codevector A is N, 

N: code for codevector A, 

t: time when the code for codevector A is N, 

Yt(l): predictive error vector at time t 

C1 N(i) : decoded codevector A, 

aN: amplitude corresponding to the code for codevector A. 
C2mt(i): codevector B, 
i: vector order, and 
I: vector length. 

[01 63] In other words, after coding, amplitude is reset such that the value, which has been obtained by differentiating 
the distortion of the above expression (20) with respect to each amplitude, becomes zero, thereby performing the train- 
ing of amplitude. Then, by the repetition of coding and training, the suitable value of each amplitude is obtained. 
[01 64] On the other hand, the decoder performs the decoding by obtaining the codevector based on the code of the 
vector transmitted. The decoder comprises the same vector codebooks (corresponding to codebooks A, B) as those of 



18 



EP 0 967 594 A1 



the coder, the amplifier storing section, and the state storing section. Then, the decoder carries out the decoding by the 
same algorithm as the decoding function of the searching section (corresponding to the codevector B) in the aforemen- 
tioned coding algorithm. 

[0165] Therefore, according to the above-mentioned embodiment, by the characteristics of the amplifier storing sec- 
5 tion and the distance calculator, the code vector of the second stage is applied to that of the first stage with a relatively 
small amount of calculations, thereby the coded distortion can be reduced. 

[01 66] The above explained the case in which the present invention was applied to the low bit rate speed coding tech- 
nique used in such as a cellular phone. However, the present invention can be employed in not only the speech coding 
but also the vector quantization for a parameter having a relatively good interpolation in a music coder and an image 
10 coder 



(Seventh embodiment) 

[0167] Next, the following will explain the CELP speech coder according to the sixth embodiment. This embodiment 
15 shows an example of a coder, which is capable of reducing the number of calculation steps for vector quantization 
processing for ACE LP type random codebook. 

[0168] FIG. 13 shows the functional block of the CELP speech coder according to this embodiment. In this CELP 
speech coder, a fitter coefficient analysis section 1002 provides the linear predictive analysis to input speech signal 
1001 so as to obtain coefficients of the synthesis fitter, and outputs the obtained coefficients of the synthesis filter to a 
20 filter coefficient quantization section 1003. The fitter coefficient quantization section 1003 quantizes the input coeffi- 
cients of the synthesis filter and outputs the quantized coefficients to a synthesis filter 1 004. 

[01 69] The synthesis filter 1 004 is constituted by the filter coefficients supplied from the fitter coefficient quantization 
section 1003. The synthesis filter 1004 is excited by an excitation signal 101 1. The excitation signal 101 1 is obtained by 
adding a signal, which is obtained by multiplying an adaptive codevector 1006, i.e., an output from an adaptive code- 
cs book 1005, by an adaptive codebook gain 1007, and a signal, which is obtained by multiplying a random codevector 
1009, i.e., an output from a random codebook 1008, by a random codebook gain 1010. 

[0170] Here, the adaptive codebook 1005 is one that stores a plurality of adaptive codevectors, which extracts the 
past excitation signal for exciting the synthesis filter every pitch cycle. The random codebook 1 007 is one that stores a 
plurality of random codevectors. The random codebook 1007 can use the excitation vector generator of the aforemen- 
30 tioned first embodiment 

[0171 ] A distortion calculator 1013 calculates a distortion between a synthetic speech signal 1012, i.e., the output of 
the synthesis filter 1 004 excited by the excitation signal 101 1 , and the input speech signal 1001 so as to carry out code 
search processing. The code search processing is one that specifies the index of the adaptive codevector 1006 for min- 
imizing the distortion calculated by the distortion calculator 1013 and that of the random gain 1009. At the same time, 
35 the code search processing is one that calculates optimum values of the adaptive codebook gain 1007 and the random 
codebook gain 1010 by which the respective output vectors are multiplied. 

[0172] A code output section 1014 outputs the quantized value of the filter coefficients obtainable from the filter coef- 
ficient quantization section 1003, the index of the adaptive codevector 1006 selected by the distortion calculator 1013 
and that of the random codevector 1009. and the quantized values of adaptive codebook gain 1007 and random code- 
40 book gain 1009 by which the respective output vectors are multiplied. The outputs from the code output section 1014 
are transmitted or stored. 

[0173] In the code search processing in the distortion calculator 1013, an adaptive codebook component of the exci- 
tation signal is first searched, and a codebook component of the excitation signal is next searched. 
[0174] The above search of the random codebook component uses an orthogonal search set forth below. 
45 [0175] The orthogonal search specifies a random vector c. which maximizes a search reference value Eort (= 
Nort/Dort) of expression (21). 

Cor(- Nort \ [iiP t ^Hc)x-(x t Hp)Hp)Hc} 2 

y °° r V (c'H'HcHp'H'Hpyip'H'Hc) 2 K ' 



where Nort: numerator term for Eort, 
Dort: denominator term for Eort. 
p: adaptive codevector already specified. 
55 H: synthesis filter coefficient matrix, 
H l : transposed matrix for H. 

X: target signal (one that is obtained by differentiating a zero input response of the synthesis filter from the input 
speech signal), and 
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c: random codevector. 

[0176] The orthogonal search is a search method for orthogonalizing random codevectors serving as candidates with 
respect to the adaptive vector specified in advance so as to specify index that minimizes the distortion from the plurality 
of orthogonalized random codevectors. The orthogonal search has the characteristics in which a accuracy for the ran- 
dom codebook search can be improved as compared with a non-orthogonal search and the quality of the synthetic 
speech can be improved. 

[0177] In the ACE LP type speech coder, the random codevector is constituted by a few signed pulses. By use of the 
above characteristic, the numerator term (Nort) of the search reference value shown in expression (21) is deformed to 
the following expression (22) so as to reduce the number of calculation steps on the numerator term. 

A/otf = {a 0 iK/ 0 )+a 1 H/(/ 1 )+- • •+a„. 1 v(^.i)} 2 (22) 



where a^: sign of i-th pulse (+1/-1). 
Ij: position of i-th pulse, 
N: number of pulses, and 
(MtP^H^xVHpJHpJH. 



[0178] rf the vaJue of <t> of expression (22) is calculated in advance as a pre-processing and expanded to an array, (N- 
1 ) elements out of array <f> are added or substituted, and the resultant is squared, whereby the numerator term of expres- 
sion (21) can be calculated. 

[0179] Next, the following will specifically explain the distortion calculator 1013, which is capable of reducing the 
number of calculation steps on the denominator term. 

[01 80] FIG. 1 4 shows the functional block of the distortion calculator 1013. The speech coder of this embodiment has 
the configuration in which the adaptive codevector 1006 and the random codevector 1009 in the configuration of FIG. 
13 are input to the distortion calculator 1013. 

[0181] in FIG. 14, the following three processing is carried out as pre-processing at the time of calculating the distor- 
tion for each random codevector. 



(1) Calculation of first matrix (N): power of synthesized adaptive codevector (pWHp) and self-correlation matrix of 
synthesis fitter's coefficients (H l H) are computed, and each element of the self-correlation matrix are multiplied by 
the above power so as to calculate matrix N (=(p t H l Hp)H t H) . 

(2) Calculate second matrix (M): time reverse synthesis is performed to the synthesized adaptive codevector for 
producing (pWH) ,and outer products of the above resultant signal (p'H'H) is calculated for producing matrix H. 

(3) Generate third matrix (L): matrix M calculated in item (2) is subtracted from matrix N calculated in item (1) so as 
to generate matrix L. 



[0182] Also, the denominator term (Dort) of expression (21) can be expanded as in the following expressions (23). 

Dort=: (c\H*Hc)(p VHp)-(pVHc)^ (23) 



= c 


'Nc-(rc) 2 


= c 


'Nc-lrcj'tr'c) 


= c 


WfcVc) 


= c 


Wfc'Mc) 


= c 


'(N-MJc 


= c 





where Nifp'H'HpJrfH the above pre-processing (1), 
r: p ! H l H the above pre-processing (2), 
M; rr 1 the above pre-processing (2), 
L: N-M the above pre-processing (3), 
C: random codevector 



[0183] Thereby, the calculation of the denominator term (Dort) at the time of the calculation of the search reference 
value (Eort) of expression (21) is replaced with expression (23), thereby making it possible to spedfy the random code- 
book component with the smaller amount of calculation. 
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[01 84] The calculation of the denominator term is carried out using the matrix L obtained in the above pre-processing 
and the random codevector 1 009. 

[0185] Here, for simplifying the explanation, the calculation method of the denominator term will be explained on the 
basis of expression (23) in a case where a sampling frequency of the input speech signal is 8000 Hz, the random code- 
5 book has Algebraic structure ,and its codevectors are constructed by five signed unit pulses per 10 ms frame 

[0186] The five signed unit pulses constituting the random vector have pulses each selected from the candidate posi- 
tions defined for each of zero to fourth groups shown in Table 2, then random vector c can be described by the following 
expression (24). 

w C = a 0 6 (k-1 0 ) + a , 6 (k-1 t ) + + a 4 6 (k-1 4 ) (k=0, 1. • ■ • 79) (24) 

where a^ sign (+1/-1) of pulse belonging to group i. and 
lj: position of pulse belonging to group i. 



Table 2 



Group Number 


Code 


Pulse Candidate Position 


0 


±1 


0,10,20,30, ,60.70 


1 


±1 


2, 12, 22, 32, • • 62, 72 


2 


±1 


2,16, 26.36. • • 66.76 


3 


±1 


4.14,24,34, • • ',64. 74 


4 


±1 


8, 18, 28,38, • • 68.78 



[01 87] At this time, the denominator term (Dort) shown by expression (23) can be obtained by the following expression 
(25): 

30 4 4 

Dort = EE a ' a / L(/ ' ,/ / ) (25) 



35 where a 1 : sign (+1/-1 ) of pulse belonging to group i, 

lj: position of pulse belonging to group i. and L(lj, ip: element (I, row and lj column) of matrix L 

[0188] As explained above, in the case where the ACELP type random codebook is used, the numerator term (Nort) 
of the code search reference value of expression (21 ) can be calculated by expression (22), while the denominator term 
40 (Dort) can be calculated by expression (25). Therefore, in the use of the ACELP type random codebook, the numerator 
term is calculated by expression (22) and the denominator term is calculated by expression (25). respectively, instead 
of directly calculating of the reference value of expression (21). This makes it possible to greatly reduce the number of 
calculation steps for vector quantization processing of random excitations. 

[0189] The aforementioned embodiments explained the random code search with no pre- selection. However, the 
45 same effect as mentioned above can be obtained rf the present invention is applied to a case in which pre-selection 
based on the values of expression (22) is employed, the values of expression (21) are calculated for only pre-s elected 
random codevectors with expression (22) and expression (25), then finally selecting one random codevector, which 
maximize the above search reference value. 

so Claims 

1 . An excitation vector generator for generating an excitation vector comprising: 

pulse vector generating means having N channels (N==1) for generating pulse vectors; 
55 storing means for storing M (Ms1) kinds of dispersion patterns every channel in accordance with N channels; 

selecting means for selectively taking out a dispersion pattern from said storing means every channel; 
dispersion means for generating N dispersed vectors by convolution calculation using the taken-out dispersion 
pattern and the generated pulse vectors every channel; and 
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excitation vector generating means for generating an excitation vector from N dispersed vectors generated. 

The excitation vector generator according to claim 1 , wherein said pulse vector generating means generates N 
pulse vectors algebraically. 

The excitation vector generator according to claim 1 , wherein said dispersion means generates the dispersed vec- 
tors every channel by the convolution calculation based on the following expression: 

L-1 

ci(n) » £ wij(n-k) di{k) 

K-0 



where n: 0-L-1, 

L: dispersed vector length, 

i: channel number, 

j: dispersion pattern number (j-1 ~M), 

ci: dispersed vector for channel i, 

wij: dispersion pattern for channel i,j 

di: pulse vector for channel 1 , 

di as ± 6 (n-pi) , n = 0-L-1 , and 

pi: pulse position candidate for channel i. 

The excitation vector generator according to claim 1, wherein said excitation vector generating means generates 
one excitation vector from N dispersed vectors based on the following expression: 

N 

c(n) = £c/(r>) 
/-1 



where c: excitation vector, 

ci: dispersed vector, 

i: channel number (i= 1-N), and 

n: vector element number (n = 0-L-1 : note that L is an excitation vector length). 

A CELP speech coder for coding speech signal comprising: 

the excitation vector generator described in claim 1 ; 

a random codebook used to vector-quantize random excitation information; 

a synthesis filter for generating a synthetic speech using an excitation vector output from said excitation vector 
generator as a random codevector; 

a distortion calculator for calculating a quantization distortion caused between the generated synthetic speech 
and an input speech; 

means for changing a combination of a pulse position, a pulse polarity, and a dispersion pattern constituting a 
pulse vector; and 

means for specifying the combination of the pulse position, the pulse polarity and the dispersion pattern such 
that the quantization distortion calculated by said distortion calculator is minimized so as to specify the index 
of random codebook. 

The CELP speech coder according to claim 5, wherein the dispersion pattern is stored in storing means of said 
excitation vector generator, said dispersion pattern obtaining by pre-training to lessen the quantization distortions 
caused during vector quantization processing for random excitations. 

The CELP speech coder according to claim 6, wherein the storing means of said excitation vector generator stores 
at least one kind of dispersion pattern obtained by training every channel. 

The CELP speech coder according to claim 7, wherein when a value of an ideal adaptive codebook gain, which has 
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been calculated at the time of vector quantization processing for adaptive excitation, is larger than a preset thresh- 
old value, the dispersion pattern obtained by training is selected. 

9. The CELP speech coder according to claim 7, wherein when a value of a decoded adaptive codebook gain is larger 
5 than a preset threshold value, the dispersion pattern obtained by training is selected. 

10. The CELP speech coder according to claim 5, wherein at least one kind of dispersion pattern stored in the storing 
means of said excitation vector generator in every channel is a random pattern. 

w 11. The CELP speech coder according to claim 5, wherein at least one kind of dispersion pattern stored in the storing 
means of said excitation vector generator in every channel is a dispersion pattern obtained by pre- training to lessen 
the quantization distortion caused during vector quantization processing for random excitations, and at least one 
kind thereof is a random pattern. 

is 1 2. The CELP speech coder according to claim 1 1 , wherein when a coding distortion caused when specifying the index 
of adaptive codebook is larger than a preset threshold value, a dispersion vector of the random pattern is selected. 

13. The CELP speech coder according to claim 5, wherein a combination index showing a combination of the disper- 
sion patterns selected by each channel is specif ied from all combinations M N of the dispersion patterns obtainable 

20 such that the quantization distortion caused during vector quantization processing for random excitation is mini- 
mized. 

14. The CELP speech coder according to claim 1 3, wherein combinations of dispersion patterns are pre-selected using 
a speech parameter obtained in advance such that the quantization distortion caused during vector quantization 

25 processing for random excitation is minimized, and a combination index showing the combination of the dispersion 
patterns selected by each channel is specified from the pre-selected combinations of the dispersion patterns. 

15. The CELP speech coder according to claim 14, wherein the combination of dispersion patterns to be pre-selected 
is changed in accordance with an analyzing result of a speech segment 

30 

16. The CELP speech coder according to claim 5, further comprising: 

target extracting means for calculating a parameter vector of a speech parameter obtained by analyzing a cur- 
rent coding frame, a parameter vector obtained by analyzing a future frame instead of the coding frame, and a 
35 quantization target vector using an encoded vector of a previous frame instead of the current coding frame; and 

vector-quantizing means for coding the calculated quantized target vector so as to obtain the index of random 
codebook for the current coding frame. 

17. The CELP speech coder according to claim 16, wherein said target extracting means calculates the target vector 
40 based on the following expression: 

x{f) - {S f (/>p(d(/)+s r+1 (oy2}/(i + p) 

where X(i): target vector, 
45 i: vector element number, 

St(i). St +1 (i): parameter vectors, 

t: time (frame number), 

p: weighting coefficient (fixed), and 

d(i): decoded vector of previous frame. 

50 

18. The CELP speech coder according to claim 16, further comprising: 

means for decoding the index of the current coding frame so as to generate a decoded codevector; 
a second distortion calculator for calculating a coding distortion from said decoded vector and a parameter vec- 
55 tor of said coding frame; and 

vector smoothing means for smoothing the parameter vector of the current coding frame to be supplied to said 
target extracting means when said coding distortion is less than a reference value. 
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19. The CELP speech coder according to claim 18, wherein said second distortion calculator calculates a perceptually 
weighted coding distortion based on the following expression: 

Ew = r(V/(/>S f (/) 2 +p{V/(/>(d(/>S f+1 (/)y2} 2 

where Ew: perceptually weighted coding distortion, 

St(i). S, +1 (i): input vector, 

t: time (frame number) 

i: vector element number, 

V(i): decoded vector, 

p: weighting coefficient, and 

d(i): coded vector of previous frame. 

20. The CELP speech coder according to claim 16, wherein said vector-quantizing means comprises: 

a plurality of codebooks, provided to correspond to each stage of a multi-stage vector quantization, storing a 
plurality of codevectors; 

means for calculating a distance between the target vector or its prediction error vector and a codevector 
stored in the codebook of the first stage so as to obtain a code of the first stage; 

an amplifier storing section storing amplitude being expressed by an amount of scalar and corresponding to 
the codevector stored in the codebook of the first stage; 

means for taking out amplitude depending on the code of the first stage from said amplifier storing section 
before a coding of a second stage is carried out so as to multiply the taken-out amplitude by a codevector of 
stored in the codebook of the second stage; and 

means for calculating a distortion between the decoded codevector decoded from the code of the first stage 
and the codevector by which the amplitude stored in the codebook of the second stage is multiplied so as to 
obtain an index of the second stage. 

21 . The CELP speech coder according to claim 5, wherein said CELP speech coder comprises an adaptive codebook 
storing an adaptive codevector expressing a pitch component of the input speech, and said distortion calculator 
comprises: 

means for computing power of a signal obtained by synthesizing said adaptive codevector by said syntheis fil- 
ter and a self-correlation matrix of a filter coefficients forming said synthesis filter so as to calculate a first 
matrix by multiplying each element of said self-correlation matrix by said power; 

means for providing a time reverse synthesis to the signal obtained by synthesizing said adaptive vector by 
said synthetic filter so as to calculate a second matrix by taking an outer product of the signal to which the time 
reverse synthesis is provided; and 

means for generating a third matrix by subtracting said second matrix from said first matrix, thereby calculating 
the distortion. 

22. A CELP speech decoder for decoding speech comprising: 

a random codebook which has the excitation vector generator described in claim 1 , for selecting a dispersion 
pattern in accordance with a random code number specifying a combination index of dispersion patterns and 
a combination index of pulse vectors, and for generating pulse vectors; and 

a synthesis filter for generating a synthetic speech using an excitation vector output from said excitation vector 
generator as a random codevector. 

23. The CELP speech decoder according to claim 22, wherein the dispersion pattern is stored in storing means of said 
excitation vector generator, said dispersion pattern obtaining by pre-training to lessen a quantization distortion 
caused during vector quantizatian processing for random excitations. 

24. The CELP speech decoder according to claim 23, wherein the storing means of said excitation vector generator 
stores at least one kind of dispersion pattern obtained by training every channel. 

25. The CELP speech decoder according to claim 22, wherein at least one kind of dispersion pattern stored in the stor- 
ing means of said excitation vector generator in every channel is a random pattern. 
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26. The CELP speech decoder according to claim 22, wherein at least one kind of dispersion pattern stored in the stor- 
ing means of said excitation vector generator in every channel is a dispersion pattern obtained by pre-training to 
lessen the quantization distortion caused when vector quantization processing for random excitations, and at least 
one kind thereof is a random pattern. 

27. A method for generating an excitation vector comprising the steps of: 

generating pulse vectors of N channels (Nsl); 

selectively taking out a dispersion pattern from storing means storing M (M§1) kinds of dispersion patterns 
every channel in accordance with N channels; 

performing a convolution using the extracted dispersion pattern and the generated pulse vectors every channel 
so as to generate N dispersed vectors; and 

generating an excitation vector from N dispersed vectors generated. 

28. A method for a CELP speech coding system comprising the steps of: 

generating a random codevector for vector quantization processing of random excitation using the excitation 
vector generator described in claim 1 ; 

generating a synthetic speech using an excitation vector output from said excitation vector generator as a ran- 
dom codevector; 

calculating a quantization distortion • caused between the generated synthetic speech and an input speech; 
changing a combination of pulse positions, pulse polarities, and a dispersion pattern constituting a pulse vec- 
tor; and 

specifying the combination of the pulse position, the pulse polarity and the dispersion pattern such that the 
quantization distortion is minimized. 

29. A method for decoding speech signal coded in a CELP system comprising the steps of : 

generating a random code vector using the excitation vector generator described in claim 1 ; and 
generating a synthetic speech using an excitation vector output from said excitation vector generator as a ran- 
dom codevector. 

30. A method for vector quantization processing for an input vector comprising the steps of: 

calculating a target vector from the input vector having a plurality of time-continuous vectors and past-decoded 
vectors; 

coding said target vector to obtain a code and decoding said code to obtain a decoded vector; 
calculating a distortion from the obtained decoded vector and said input vector; 
specifying a code for minimizing said distortion; 
storing the decoded vector; and 

updating the decoded vector by a decoded vector corresponding to a final code. 

31 . A communication apparatus comprising the CELP coder described in claim 5. 

32. A communication apparatus comprising the CELP decoder described in claim 22. 
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FIG. 2 
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